Search CORE

17 research outputs found

Computational cluster validation for microarray data analysis: experimental assessment of Clest, Consensus Clustering, Figure of Merit, Gap Statistics and Model Explorer

Author: A Alizadeh
A Ben-Hur
A Jain
A Kapp
AD Gordon
AK Jain
B Everitt
B Mirkin
CV Rijsbergen
Davide Scaturro
E Fowlkes
E Hartuv
Filippo Utro
GJ McLachlan
GW Milligan
I Priness
J Handl
JA Hartigan
JA Rice
JN Breckenridge
KY Yeung
L Hubert
L Kaufman
M Yan
P Hansen
PT Spellman
R Shamir
R Tibshirani
Raffaele Giancarlo
S Datta
S Dudoit
S Monti
T Hastie
V Di Gesú
W Krzanowski
X Wen
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

This is an Open Access article distributed under the terms of the Creative Commons Attribution Licens

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Archivio istituzionale della ricerca - Università di Palermo

Network Inference Algorithms Elucidate Nrf2 Regulation of Mouse Lung Oxidative Stress

Author: A Jacquier
A Otomo
AA Margolin
AA Margolin
AK Jaiswal
AK Jaiswal
B Biteau
C-C Chang
CJ Reed
CM Clements
CO Daub
D Giustarini
Deepti Malhotra
DJ Moore
EY Park
George Acquaah-Mensah
GK Acquaah-Mensah
H Cai
H Ohkawa
I Nagano
I Priness
I Rahman
IH Witten
J Choi
JJ Faith
K Basso
K Itoh
K Iwasaki
M Kanehisa
M Matsuoka
M Singhal
MM Gallogly
Mudita Singhal
N Christianni
N Slonim
N Watanabe
P Shannon
PL Whitney
R Venugopal
R Venugopal
RA Irizarry
RC Taylor
RC Taylor
RG Will
RK Thimmulappa
Ronald C. Taylor
Ruth Nussinov
S Hadano
S Mead
SE Keene
Shyam Biswal
T Rangasamy
TM Cover
U Alon
U Alon
V Bonifati
VJ Findlay
W Droge
W Zhou
WW Wasserman
XL Chen
Y El-Manzalawy
Y Katoh
Y Li
Publication venue: Public Library of Science
Publication date: 01/08/2008
Field of study

A variety of cardiovascular, neurological, and neoplastic conditions have been associated with oxidative stress, i.e., conditions under which levels of reactive oxygen species (ROS) are elevated over significant periods. Nuclear factor erythroid 2-related factor (Nrf2) regulates the transcription of several gene products involved in the protective response to oxidative stress. The transcriptional regulatory and signaling relationships linking gene products involved in the response to oxidative stress are, currently, only partially resolved. Microarray data constitute RNA abundance measures representing gene expression patterns. In some cases, these patterns can identify the molecular interactions of gene products. They can be, in effect, proxies for protein–protein and protein–DNA interactions. Traditional techniques used for clustering coregulated genes on high-throughput gene arrays are rarely capable of distinguishing between direct transcriptional regulatory interactions and indirect ones. In this study, newly developed information-theoretic algorithms that employ the concept of mutual information were used: the Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNE), and Context Likelihood of Relatedness (CLR). These algorithms captured dependencies in the gene expression profiles of the mouse lung, allowing the regulatory effect of Nrf2 in response to oxidative stress to be determined more precisely. In addition, a characterization of promoter sequences of Nrf2 regulatory targets was conducted using a Support Vector Machine classification algorithm to corroborate ARACNE and CLR predictions. Inferred networks were analyzed, compared, and integrated using the Collective Analysis of Biological Interaction Networks (CABIN) plug-in of Cytoscape. Using the two network inference algorithms and one machine learning algorithm, a number of both previously known and novel targets of Nrf2 transcriptional activation were identified. Genes predicted as novel Nrf2 targets include Atf1, Srxn1, Prnp, Sod2, Als2, Nfkbib, and Ppp1r15b. Furthermore, microarray and quantitative RT-PCR experiments following cigarette-smoke-induced oxidative stress in Nrf2+/+ and Nrf2−/− mouse lung affirmed many of the predictions made. Several new potential feed-forward regulatory loops involving Nrf2, Nqo1, Srxn1, Prdx1, Als2, Atf1, Sod1, and Park7 were predicted. This work shows the promise of network inference algorithms operating on high-throughput gene expression data in identifying transcriptional regulatory and other signaling relationships implicated in mammalian disease

Crossref

Directory of Open Access Journals

PubMed Central

MCAM: Multiple Clustering Analysis Methodology for Deriving Hypotheses and Insights from High-Throughput Proteomic Datasets

Author: A Jain
A Wolf-Yadlin
AG Batzer
AQ Emili
B Nolen
BA Babbin
BA Joughin
BJ Frey
C Badowski
C Choudhary
Douglas A. Lauffenburger
E Darnell J J
ET Bowden
F Attanasio
F Diella
Forest M. White
I Priness
J McCallum
J Saez-Rodriguez
J Shi
JA Cooper
Jason A. Papin
JV Olsen
K Azuma
K Tashiro
KM Naegle
Kristen M. Naegle
KS Ravichandran
M Gensler
M Mann
M Oser
MA Davis
MB Eisen
MB Yaffe
Michael B. Yaffe
MJ Hayes
NG Oberprieler
P D'Haeseleer
P Tamayo
R Giancarlo
RA van den Berg
Roy E. Welsch
S Feo
S Tavazoie
SK Mitra
T Hitosugi
T Kohonen
X Li
Y Benjamini
Y Yarden
Y Zhang
Y Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/02/2011
Field of study

Advances in proteomic technologies continue to substantially accelerate capability for generating experimental data on protein levels, states, and activities in biological samples. For example, studies on receptor tyrosine kinase signaling networks can now capture the phosphorylation state of hundreds to thousands of proteins across multiple conditions. However, little is known about the function of many of these protein modifications, or the enzymes responsible for modifying them. To address this challenge, we have developed an approach that enhances the power of clustering techniques to infer functional and regulatory meaning of protein states in cell signaling networks. We have created a new computational framework for applying clustering to biological data in order to overcome the typical dependence on specific a priori assumptions and expert knowledge concerning the technical aspects of clustering. Multiple clustering analysis methodology (‘MCAM’) employs an array of diverse data transformations, distance metrics, set sizes, and clustering algorithms, in a combinatorial fashion, to create a suite of clustering sets. These sets are then evaluated based on their ability to produce biological insights through statistical enrichment of metadata relating to knowledge concerning protein functions, kinase substrates, and sequence motifs. We applied MCAM to a set of dynamic phosphorylation measurements of the ERRB network to explore the relationships between algorithmic parameters and the biological meaning that could be inferred and report on interesting biological predictions. Further, we applied MCAM to multiple phosphoproteomic datasets for the ERBB network, which allowed us to compare independent and incomplete overlapping measurements of phosphorylation sites in the network. We report specific and global differences of the ERBB network stimulated with different ligands and with changes in HER2 expression. Overall, we offer MCAM as a broadly-applicable approach for analysis of proteomic data which may help increase the current understanding of molecular networks in a variety of biological problems.National Institutes of Health (U.S.) (NIH-U54-CA112967 )National Institutes of Health (U.S.) (NIH-R01-CA096504

Public Library of Science (PLOS)

DSpace@MIT

Crossref

Directory of Open Access Journals

PubMed Central